Shadowing Synthesized Speech - Segmental Analysis of Phonetic Convergence

نویسندگان

  • Iona Gessinger
  • Eran Raveh
  • Sébastien Le Maguer
  • Bernd Möbius
  • Ingmar Steiner
چکیده

To shed light on the question whether humans converge phonetically to synthesized speech, a shadowing experiment was conducted using three different types of stimuli – natural speaker, diphone synthesis, and HMM synthesis. Three segment-level phonetic features of German that are well-known to vary across native speakers were examined. The first feature triggered convergence in roughly one third of the cases for all stimulus types. The second feature showed generally a small amount of convergence, which may be due to the nature of the feature itself. Still the effect was strongest for the natural stimuli, followed by the HMM stimuli and weakest for the diphone stimuli. The effect of the third feature was clearly observable for the natural stimuli and less pronounced in the synthetic stimuli. This is presumably a result of the partly insufficient perceptibility of this target feature in the synthetic stimuli and demonstrates the necessity of gaining fine-grained control over the synthesis output, should it be intended to implement capabilities of phonetic convergence on the segmental level in spoken dialogue systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Effect of Fundamental Frequency on Phonetic Convergence

This paper examines the importance of fundamental frequency (F0) in the process of phonetic convergence within an immediate-repetition or “shadowing” task. Previous research has suggested that F0 facilitates the transmission of social information that individuals can use to establish their social orientation within an interaction (Gregory et al., 1991, 1996, 2001). Social theories of accommodat...

متن کامل

Phonetic convergence in shadowed speech: a comparison of perceptual and acoustic measures

Phonetic convergence is highly variable across studies, measures, and analyses. The current paper describes a study that examined multiple acoustic measures in concert with a perceptual measure of phonetic convergence. The study employed a shadowing task in which multiple talkers shadowed words from a set of models. Across different scales of analysis, the acoustic measures were highly variable...

متن کامل

How much imitation is there in a shadowing task?

Phonetic imitation, also called phonetic convergence, is currently at the heart of numerous investigations since it can inform us on both the nature of lexical representations and the link between production and perception processes in spoken language communication. A task that has been largely used to study phonetic imitation is the shadowing task, in which participants merely listen to and re...

متن کامل

Prediction and imitation in speech

It has been suggested that intra- and inter-speaker variability in speech are correlated. Interlocutors have been shown to converge on various phonetic dimensions. In addition, speakers imitate the phonetic properties of voices they are exposed to in shadowing, repetition, and even passive listening tasks. We review three theoretical accounts of speech imitation and convergence phenomena: (i) t...

متن کامل

Proceedings of Meetings on Acoustics

Phonetic convergence occurs both when individuals interact in conversation, and when listeners rapidly repeat words presented over headphones. Results from multiple studies examining phonetic convergence offer an array of often confusing and disparate findings. Reconciling such diverse findings is difficult without a clear rationale for engaging in one acoustic measure over another. The current...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017